Gradient-based Sampling: An Adaptive Importance Sampling for Least-squares
نویسنده
چکیده
In modern data analysis, random sampling is an efficient and widely-used strategy to overcome the computational difficulties brought by large sample size. In previous studies, researchers conducted random sampling which is according to the input data but independent on the response variable, however the response variable may also be informative for sampling. In this paper we propose an adaptive sampling called the gradient-based sampling which is dependent on both the input data and the output for fast solving of least-square (LS) problems. We draw the data points by random sampling from the full data according to their gradient values. This sampling is computationally saving, since the running time of computing the sampling probabilities is reduced to O(nd) where n is the full sample size and d is the dimension of the input. Theoretically, we establish an error bound analysis of the general importance sampling with respect to LS solution from full data. The result establishes an improved performance of the use of our gradientbased sampling. Synthetic and real data sets are used to empirically argue that the gradient-based sampling has an obvious advantage over existing sampling methods from two aspects of statistical efficiency and computational saving.
منابع مشابه
Stochastic Gradient Descent, Weighted Sampling, and the Randomized Kaczmarz algorithm
We obtain an improved finite-sample guarantee on the linear convergence of stochastic gradient descent for smooth and strongly convex objectives, improving from a quadratic dependence on the conditioning (L/μ) (where L is a bound on the smoothness and μ on the strong convexity) to a linear dependence on L/μ. Furthermore, we show how reweighting the sampling distribution (i.e. importance samplin...
متن کاملLeast Squares Importance Sampling for Libor Market Models
A recently introduced Importance Sampling strategy based on a least squares optimization is applied to the Monte Carlo simulation of Libor Market Models. Such Least Squares Importance Sampling (LSIS) allows the automatic optimization of the sampling distribution within a trial class by means of a quick presimulation algorithm of straightforward implementation. With several numerical examples we...
متن کاملUsing MCMC Sampling to Calibrate a Computer Model of a Geothermal Field
We take a Bayesian approach to the calibration of an eight-parameter model of a geothermal field, analyzing measured well-test data. The posterior distribution over parameters for each of three scenarios, using different training data subsets, is explored using Markov chain Monte Carlo sampling. A novel parallel rejection algorithm is used to reduce 1 computation time. Caparison across scenario...
متن کاملMixed-signal gradient flow bearing estimation
A mixed-signal architecture for estimating the 3-D direction cosines of a broadband traveling wave impinging on an array of four sensors is presented. The architecture implements gradient flow, which converts the problem of resolving time delays between sensor observations into the problem of estimating relative amplitudes of spatial and temporal derivatives over the array. Direction cosines of...
متن کاملImportance-weighted least-squares probabilistic classifier for covariate shift adaptation with application to human activity recognition
Human activity recognition from accelerometric data (e.g., obtained by smart phones) is gathering a great deal of attention since it can be used for various purposes such as remote health-care. However, since collecting labeled data is bothersome for new users, it is desirable to utilize data obtained from existing users. In this paper, we formulate this adaptation problem as learning under cov...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016